Search results for "open data"
showing 10 items of 50 documents
Benchmarking open data efforts through indices and rankings: Assessing development and contexts of use
2022
Abstract This paper aims to provide a broad perspective on the development of benchmarking open data efforts through indices and rankings over the years, both at the level of countries and allowing for a cross-country comparison. The methodology follows a systematic search for the relevant resources, their classification and identification of six open data benchmarks to be further analyzed, the identification of their key components through decomposition, their description, and identifying the similarities and differences. Three major groups of indices and four periods that characterize the efforts to benchmark and measure the development of open data are identified, where the first measure…
Predictive pumping based on sensor data and weather forecast
2019
In energy production, peat extraction has a significant role in Finland. However, protection of nature has become more and more important globally. How do we solve this conflict of interests respecting both views? In peat production, one important phase is to drain peat bog so that peat production becomes available. This means that we have control over how we can lead water away from peat bog to nature without water contamination with solid and other harmful substances. In this paper we describe a novel method how fouling of water bodies from peat bog can be controlled more efficiently by using weather forecast to predict rainfall and thus, minimize the effluents to nature. peerReviewed
Do Dr. Google and Health Apps Have (Comparable) Side Effects? An Experimental Study
2020
Googling and using apps for health-related information are highly prevalent worldwide. So far, little is known about the emotional, body-related, and behavioral effects of using both Google and health-related apps. In our experimental study, bodily symptoms were first provoked by a standardized hyperventilation test. A total of 147 participants (96.6% students) were then randomly assigned to one of three conditions: Googling for the causes of the currently experienced bodily symptoms, using a medical app to diagnose the experienced symptoms, and a waiting control condition. Health-related Internet use for symptoms led to stronger negative affect, increased health anxiety, and increased nee…
openSNP–A Crowdsourced Web Resource for Personal Genomics
2014
Genome-Wide Association Studies are widely used to correlate phenotypic traits with genetic variants. These studies usually compare the genetic variation between two groups to single out certain Single Nucleotide Polymorphisms (SNPs) that are linked to a phenotypic variation in one of the groups. However, it is necessary to have a large enough sample size to find statistically significant correlations. Direct-To-Consumer (DTC) genetic testing can supply additional data: DTC-companies offer the analysis of a large amount of SNPs for an individual at low cost without the need to consult a physician or geneticist. Over 100,000 people have already been genotyped through Direct-To-Consumer genet…
Automating statistical diagrammatic representations with data characterization
2017
The search for an efficient method to enhance data cognition is especially important when managing data from multidimensional databases. Open data policies have dramatically increased not only the volume of data available to the public, but also the need to automate the translation of data into efficient graphical representations. Graphic automation involves producing an algorithm that necessarily contains inputs derived from the type of data. A set of rules are then applied to combine the input variables and produce a graphical representation. Automated systems, however, fail to provide an efficient graphical representation because they only consider either a one-dimensional characterizat…
Visualising maritime vessel open data for better situational awareness in ice conditions
2018
Situational awareness of maritime vessels in ice conditions is important for the operation of supply chains. In the artic sea areas, the ice conditions pose a major challenge for maritime vessels getting stuck in the ice and being significantly delayed in arrival to harbor. Data science and open data provide new opportunities to overcome these challenges. This paper introduces available open data sources and data visualizations that can be used to develop applications, for example, for detecting maritime vessel collision, predicting estimated time of arrival to harbor, as well as maritime vessel route optimization in ice conditions. The paper begins by introducing available open data source…
Big Data in Emergency Management: Exploitation Techniques for Social and Mobile Data
2020
The Internet of Things, crowdsourcing, social media, public authorities, and other sources generate bigger and bigger data sets. Big and open data offers many benefits for emergency management, but also pose new challenges. This chapter will review the sources of big data and their characteristics. We then discuss potential benefits of big data for emergency management along with the technological and societal challenges it poses. We review central technologies for big-data storage and processing in general, before presenting the Spark big-data engine in more detail. Finally, we review ethical and societal threats that big data pose.
A VIRTUAL HUB BROKERING APPROACH FOR INTEGRATION OF HISTORICAL AND MODERN MAPS
2016
Geospatial data are today more and more widespread. Many different institutions, such as Geographical Institutes, Public Administrations, collaborative communities (e.g., OSM) and web companies, make available nowadays a large number of maps. Besides this cartography, projects of digitizing, georeferencing and web publication of historical maps have increasingly spread in the recent years. In spite of these variety and availability of data, information overload makes difficult their discovery and management: without knowing the specific repository where the data are stored, it is difficult to find the information required and problems of interconnection between different data sources and th…
Biased graph walks for RDF graph embeddings
2017
Knowledge Graphs have been recognized as a valuable source for background information in many data mining, information retrieval, natural language processing, and knowledge extraction tasks. However, obtaining a suitable feature vector representation from RDF graphs is a challenging task. In this paper, we extend the RDF2Vec approach, which leverages language modeling techniques for unsupervised feature extraction from sequences of entities. We generate sequences by exploiting local information from graph substructures, harvested by graph walks, and learn latent numerical representations of entities in RDF graphs. We extend the way we compute feature vector representations by comparing twel…
An Extended Data Object-driven Approach to Data Quality Evaluation: Contextual Data Quality Analysis
2019
This research is an extension of a data object-driven approach to data quality evaluation allowing to analyse data object quality in scope of multiple data objects. Previously presented approach was used to analyse one particular data object, mainly focusing on syntactic analysis. It means that the primary data object quality can be analysed against secondary data objects of unlimited number. This opportunity allows making more comprehensive, in-depth contextual data object analysis. The given analysis was applied to open data sets, making comparison between previously obtained results and results of application of the extended approach, underlying importance and benefits of the given exten…